Overview

Dataset statistics

Number of variables33
Number of observations410706
Missing cells1731819
Missing cells (%)12.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory735.1 MiB
Average record size in memory1.8 KiB

Variable types

NUM18
CAT14
BOOL1

Reproduction

Analysis started2020-12-19 05:23:46.152888
Analysis finished2020-12-19 06:39:54.993037
Versionpandas-profiling v2.6.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
name has a high cardinality: 1565 distinct values High cardinality
review_text has a high cardinality: 96066 distinct values High cardinality
production_info_name has a high cardinality: 1565 distinct values High cardinality
production_info_brand_name has a high cardinality: 379 distinct values High cardinality
production_info_explain has a high cardinality: 4505 distinct values High cardinality
house_region has a high cardinality: 53 distinct values High cardinality
house_color_list has a high cardinality: 52 distinct values High cardinality
view_count is highly correlated with scrap_countHigh Correlation
scrap_count is highly correlated with view_countHigh Correlation
product_info_id is highly correlated with idHigh Correlation
id is highly correlated with product_info_idHigh Correlation
house_scrap_count is highly correlated with house_like_countHigh Correlation
house_like_count is highly correlated with house_scrap_countHigh Correlation
house_expertise is highly correlated with house_region and 2 other fieldsHigh Correlation
house_region is highly correlated with house_expertiseHigh Correlation
house_color_list is highly correlated with house_expertiseHigh Correlation
house_constructions is highly correlated with house_expertiseHigh Correlation
production_info_explain has 10489 (2.6%) missing values Missing
house_residence has 96288 (23.4%) missing values Missing
house_area has 96288 (23.4%) missing values Missing
house_region has 166224 (40.5%) missing values Missing
house_expertise has 96288 (23.4%) missing values Missing
house_color_list has 180330 (43.9%) missing values Missing
house_style_list has 163586 (39.8%) missing values Missing
house_constructions has 344598 (83.9%) missing values Missing
house_family_list has 96288 (23.4%) missing values Missing
house_like_count has 96288 (23.4%) missing values Missing
house_reply_count has 96288 (23.4%) missing values Missing
house_scrap_count has 96288 (23.4%) missing values Missing
house_view_count has 96288 (23.4%) missing values Missing
house_share_count has 96288 (23.4%) missing values Missing
review_star_durability has 11489 (2.8%) zeros Zeros
review_star_design has 11489 (2.8%) zeros Zeros
review_star_cost has 11489 (2.8%) zeros Zeros
review_star_delivery has 11489 (2.8%) zeros Zeros

Variables

df_index
Real number (ℝ≥0)

UNIQUE
Distinct count410706
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean206061.6944
Minimum2
Maximum413661
Zeros0
Zeros (%)0.0%
Memory size3.1 MiB

Quantile statistics

Minimum2
5-th percentile20567.25
Q1102904.25
median205835.5
Q3308778.75
95-th percentile392929.75
Maximum413661
Range413659
Interquartile range (IQR)205874.5

Descriptive statistics

Standard deviation119183.7904
Coefficient of variation (CV)0.578388869
Kurtosis-1.193397637
Mean206061.6944
Median Absolute Deviation (MAD)103144.7033
Skewness0.007492239736
Sum8.463077426e+10
Variance1.420477589e+10
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[2.000000e+00 3.570595e+05 3.615195e+05 3.657795e+05 4.136610e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2047 1 < 0.1%
 
230618 1 < 0.1%
 
251088 1 < 0.1%
 
253137 1 < 0.1%
 
246994 1 < 0.1%
 
249043 1 < 0.1%
 
259284 1 < 0.1%
 
261333 1 < 0.1%
 
255190 1 < 0.1%
 
257239 1 < 0.1%
 
Other values (410696) 410696 > 99.9%
 
ValueCountFrequency (%) 
2 1 < 0.1%
 
4 1 < 0.1%
 
5 1 < 0.1%
 
6 1 < 0.1%
 
7 1 < 0.1%
 
ValueCountFrequency (%) 
413661 1 < 0.1%
 
413660 1 < 0.1%
 
413659 1 < 0.1%
 
413658 1 < 0.1%
 
413657 1 < 0.1%
 

id
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count1565
Unique (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean254435.5721
Minimum1666
Maximum605719
Zeros0
Zeros (%)0.0%
Memory size3.1 MiB

Quantile statistics

Minimum1666
5-th percentile32151
Q199687
median309648
Q3388715
95-th percentile388715
Maximum605719
Range604053
Interquartile range (IQR)289028

Descriptive statistics

Standard deviation147396.7359
Coefficient of variation (CV)0.5793086819
Kurtosis-1.378191839
Mean254435.5721
Median Absolute Deviation (MAD)134679.6247
Skewness-0.3802051181
Sum1.044982161e+11
Variance2.172579776e+10
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1666. 6500.5 7051. 7498. 9701.5 ... 591986. 592077. 593227.5 600047. 605719. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
388715 124975 30.4%
 
309648 44991 11.0%
 
32151 36549 8.9%
 
59910 17180 4.2%
 
111084 14112 3.4%
 
111274 8935 2.2%
 
161832 8271 2.0%
 
312751 5935 1.4%
 
208837 5486 1.3%
 
314450 5360 1.3%
 
Other values (1555) 138912 33.8%
 
ValueCountFrequency (%) 
1666 1 < 0.1%
 
3940 2 < 0.1%
 
4293 4 < 0.1%
 
6289 1 < 0.1%
 
6712 506 0.1%
 
ValueCountFrequency (%) 
605719 409 0.1%
 
594375 8 < 0.1%
 
592080 14 < 0.1%
 
592074 11 < 0.1%
 
591898 1 < 0.1%
 

name
Categorical

HIGH CARDINALITY
Distinct count1565
Unique (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.1 MiB
순수원목 A사이드테이블 3colors
124975
[주말특가] 스테이 프리미엄 차렵이불(세트) 10colors
44991
타미 1인 패브릭소파 3colors
36549
전자레인지 20L 실속형 다이얼식 MEM-GP20W MEM-GP20B 2colors
 
17180
빈백 607C 그랜드 소파 12colors
 
14112
Other values (1560)
172899
ValueCountFrequency (%) 
순수원목 A사이드테이블 3colors 124975 30.4%
 
[주말특가] 스테이 프리미엄 차렵이불(세트) 10colors 44991 11.0%
 
타미 1인 패브릭소파 3colors 36549 8.9%
 
전자레인지 20L 실속형 다이얼식 MEM-GP20W MEM-GP20B 2colors 17180 4.2%
 
빈백 607C 그랜드 소파 12colors 14112 3.4%
 
무선센서등 건전지 LED 동작감지 등 8935 2.2%
 
에스프레소 스팀 커피머신 2color 8271 2.0%
 
슈크림 세미워셔 차렵이불(세트) 10colors 5935 1.4%
 
유럽 안전인증! 혼요 오가닉코팅 디지털 에어프라이어 5486 1.3%
 
빅플러스 3D 거실 LED 벽시계-화이트 5360 1.3%
 
Other values (1555) 138912 33.8%
 

Length

Max length52
Mean length25.47160986
Min length3
ValueCountFrequency (%) 
Other_Letter 594 87.5%
 
Uppercase_Letter 28 4.1%
 
Lowercase_Letter 25 3.7%
 
Decimal_Number 10 1.5%
 
Other_Punctuation 8 1.2%
 
Math_Symbol 3 0.4%
 
Space_Separator 2 0.3%
 
Open_Punctuation 2 0.3%
 
Close_Punctuation 2 0.3%
 
Letter_Number 1 0.1%
 
Other values (4) 4 0.6%
 
ValueCountFrequency (%) 
Hangul 594 87.5%
 
Latin 54 8.0%
 
Common 31 4.6%
 
ValueCountFrequency (%) 
Hangul 594 88.0%
 
ASCII 80 11.9%
 
Number Forms 1 0.1%
 

review_count
Real number (ℝ≥0)

Distinct count225
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10442.80787
Minimum1
Maximum28574
Zeros0
Zeros (%)0.0%
Memory size3.1 MiB

Quantile statistics

Minimum1
5-th percentile68
Q1569
median4062
Q328574
95-th percentile28574
Maximum28574
Range28573
Interquartile range (IQR)28005

Descriptive statistics

Standard deviation12225.34765
Coefficient of variation (CV)1.170695449
Kurtosis-1.329885227
Mean10442.80787
Median Absolute Deviation (MAD)11034.39315
Skewness0.7393876716
Sum4288923848
Variance149459125.1
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1.00000e+00 1.50000e+00 2.50000e+00 1.05000e+01 1.35000e+01 ... 3.08350e+03 3.74950e+03 6.22650e+03 1.84825e+04 2.85740e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
28574 124975 30.4%
 
8391 44991 11.0%
 
4062 36549 8.9%
 
3437 17180 4.2%
 
1569 14112 3.4%
 
1788 8935 2.2%
 
920 8271 2.0%
 
1188 5935 1.4%
 
423 5486 1.3%
 
1073 5360 1.3%
 
Other values (215) 138912 33.8%
 
ValueCountFrequency (%) 
1 1 < 0.1%
 
2 331 0.1%
 
3 456 0.1%
 
4 450 0.1%
 
5 528 0.1%
 
ValueCountFrequency (%) 
28574 124975 30.4%
 
8391 44991 11.0%
 
4062 36549 8.9%
 
3437 17180 4.2%
 
2730 2730 0.7%
 

review_average
Real number (ℝ≥0)

Distinct count128
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.550996771
Minimum2.12
Maximum5
Zeros0
Zeros (%)0.0%
Memory size3.1 MiB

Quantile statistics

Minimum2.12
5-th percentile4.44
Q14.53
median4.53
Q34.57
95-th percentile4.69
Maximum5
Range2.88
Interquartile range (IQR)0.04

Descriptive statistics

Standard deviation0.08787678319
Coefficient of variation (CV)0.01930934861
Kurtosis22.77102488
Mean4.550996771
Median Absolute Deviation (MAD)0.05589810281
Skewness-0.9292675288
Sum1869121.68
Variance0.007722329024
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[2.12 2.885 3.47 3.72 3.765 ... 4.945 4.955 4.975 4.99 5. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
4.53 147395 35.9%
 
4.51 50159 12.2%
 
4.56 45370 11.0%
 
4.67 21150 5.1%
 
4.57 16655 4.1%
 
4.66 12853 3.1%
 
4.5 11626 2.8%
 
4.47 7863 1.9%
 
4.59 7389 1.8%
 
4.44 7165 1.7%
 
Other values (118) 83081 20.2%
 
ValueCountFrequency (%) 
2.12 1 < 0.1%
 
2.17 2 < 0.1%
 
2.38 1 < 0.1%
 
2.62 1 < 0.1%
 
2.75 1 < 0.1%
 
ValueCountFrequency (%) 
5 991 0.2%
 
4.98 28 < 0.1%
 
4.97 95 < 0.1%
 
4.96 69 < 0.1%
 
4.95 29 < 0.1%
 

scrap_count
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count911
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24490.62136
Minimum0
Maximum99468
Zeros6
Zeros (%)< 0.1%
Memory size3.1 MiB

Quantile statistics

Minimum0
5-th percentile899
Q18568
median20610
Q322294
95-th percentile99468
Maximum99468
Range99468
Interquartile range (IQR)13726

Descriptive statistics

Standard deviation25870.06628
Coefficient of variation (CV)1.056325436
Kurtosis3.520131085
Mean24490.62136
Median Absolute Deviation (MAD)16085.94076
Skewness2.096225453
Sum1.005844514e+10
Variance669260329.1
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.0000e+00 7.5000e+00 8.5000e+00 9.5000e+00 1.0500e+01 ... 4.1338e+04 5.2964e+04 6.7085e+04 8.4574e+04 9.9468e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
20610 124975 30.4%
 
22294 44991 11.0%
 
99468 36549 8.9%
 
17642 17180 4.2%
 
41438 14112 3.4%
 
14654 8935 2.2%
 
20500 8271 2.0%
 
22358 5935 1.4%
 
4776 5486 1.3%
 
5342 5360 1.3%
 
Other values (901) 138912 33.8%
 
ValueCountFrequency (%) 
0 6 < 0.1%
 
1 30 < 0.1%
 
2 36 < 0.1%
 
3 36 < 0.1%
 
4 25 < 0.1%
 
ValueCountFrequency (%) 
99468 36549 8.9%
 
69680 2730 0.7%
 
64490 3655 0.9%
 
41438 14112 3.4%
 
41238 348 0.1%
 

view_count
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count1466
Unique (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean342980.1128
Minimum8
Maximum1685609
Zeros0
Zeros (%)0.0%
Memory size3.1 MiB

Quantile statistics

Minimum8
5-th percentile9912
Q1126401
median162635
Q3395346
95-th percentile1685609
Maximum1685609
Range1685601
Interquartile range (IQR)268945

Descriptive statistics

Standard deviation454056.2031
Coefficient of variation (CV)1.323855775
Kurtosis3.956333946
Mean342980.1128
Median Absolute Deviation (MAD)298103.8998
Skewness2.271232334
Sum1.408639902e+11
Variance2.061670356e+11
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[8.000000e+00 3.950000e+01 4.800000e+01 6.150000e+01 6.250000e+01 ... 7.630140e+05 8.467215e+05 9.955725e+05 1.399133e+06 1.685609e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
162635 124975 30.4%
 
395346 44991 11.0%
 
1685609 36549 8.9%
 
218606 17180 4.2%
 
546110 14112 3.4%
 
205943 8935 2.2%
 
283973 8271 2.0%
 
74681 5935 1.4%
 
53479 5486 1.3%
 
67275 5360 1.3%
 
Other values (1456) 138912 33.8%
 
ValueCountFrequency (%) 
8 3 < 0.1%
 
9 1 < 0.1%
 
11 2 < 0.1%
 
20 10 < 0.1%
 
23 1 < 0.1%
 
ValueCountFrequency (%) 
1685609 36549 8.9%
 
1112657 3655 0.9%
 
878488 2730 0.7%
 
814955 1936 0.5%
 
711073 1259 0.3%
 

review_user_id
Real number (ℝ≥0)

Distinct count85461
Unique (%)20.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5074734.379
Minimum793
Maximum11401724
Zeros0
Zeros (%)0.0%
Memory size3.1 MiB

Quantile statistics

Minimum793
5-th percentile942734
Q12565275
median4928460
Q37323703
95-th percentile10039852
Maximum11401724
Range11400931
Interquartile range (IQR)4758428

Descriptive statistics

Standard deviation2877081.777
Coefficient of variation (CV)0.5669423387
Kurtosis-1.017689084
Mean5074734.379
Median Absolute Deviation (MAD)2460118.507
Skewness0.2125056904
Sum2.084223858e+12
Variance8.27759955e+12
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[7.9300000e+02 1.8555500e+04 1.9327000e+04 5.6444500e+04 5.6481000e+04 ... 1.1279058e+07 1.1296349e+07 1.1333968e+07 1.1335588e+07 1.1401724e+07], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2692433 2140 0.5%
 
1102169 59 < 0.1%
 
3929681 54 < 0.1%
 
2807693 54 < 0.1%
 
2183498 52 < 0.1%
 
3632293 51 < 0.1%
 
7472708 50 < 0.1%
 
3172776 50 < 0.1%
 
2851100 50 < 0.1%
 
1429879 50 < 0.1%
 
Other values (85451) 408096 99.4%
 
ValueCountFrequency (%) 
793 5 < 0.1%
 
3580 1 < 0.1%
 
3765 5 < 0.1%
 
5140 1 < 0.1%
 
8544 1 < 0.1%
 
ValueCountFrequency (%) 
11401724 5 < 0.1%
 
11392745 1 < 0.1%
 
11387410 1 < 0.1%
 
11385035 5 < 0.1%
 
11382685 5 < 0.1%
 

review_status
Categorical

CONSTANT
REJECTED
Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.1 MiB
확인완료
410706
ValueCountFrequency (%) 
확인완료 410706 100.0%
 

Length

Max length4
Mean length4
Min length4
ValueCountFrequency (%) 
Other_Letter 4 100.0%
 
ValueCountFrequency (%) 
Hangul 4 100.0%
 
ValueCountFrequency (%) 
Hangul 4 100.0%
 

review_text
Categorical

HIGH CARDINALITY
Distinct count96066
Unique (%)23.4%
Missing0
Missing (%)0.0%
Memory size3.1 MiB
깔끔한 인테리어에 아주 적합한,,,,
 
50
너무이뻐요 잘쓸게요 ㅎㅎ지인에게 추천해줬어요
 
50
좋아요~~~~~~~~~~~~~~~~~
 
49
좋아요좋아요좋아요좋아요좋아요좋아요좋아요
 
45
좋아여 잘 쓸 것 같습니다 핳핳핳핳하
 
38
Other values (96061)
410474
ValueCountFrequency (%) 
깔끔한 인테리어에 아주 적합한,,,, 50 < 0.1%
 
너무이뻐요 잘쓸게요 ㅎㅎ지인에게 추천해줬어요 50 < 0.1%
 
좋아요~~~~~~~~~~~~~~~~~ 49 < 0.1%
 
좋아요좋아요좋아요좋아요좋아요좋아요좋아요 45 < 0.1%
 
좋아여 잘 쓸 것 같습니다 핳핳핳핳하 38 < 0.1%
 
리뷰보고 구매했는데 전체적으로 만족합니다~^^ 34 < 0.1%
 
.................... 32 < 0.1%
 
깨끗하고 이쁘고 배송빠르고 짱짱이에요 31 < 0.1%
 
디자인 좋구요... 튼튼해요... 맘에 듭니다. 30 < 0.1%
 
배송 도 빠르고 제품도 좋아요 잘쓰고있어요>•< 30 < 0.1%
 
Other values (96056) 410317 99.9%
 

Length

Max length3171
Mean length63.07105082
Min length9
ValueCountFrequency (%) 
Other_Letter 2435 82.1%
 
Other_Symbol 256 8.6%
 
Nonspacing_Mark 52 1.8%
 
Lowercase_Letter 43 1.4%
 
Other_Punctuation 31 1.0%
 
Uppercase_Letter 27 0.9%
 
Math_Symbol 21 0.7%
 
Modifier_Symbol 21 0.7%
 
Decimal_Number 15 0.5%
 
Modifier_Letter 13 0.4%
 
Other values (13) 52 1.8%
 
ValueCountFrequency (%) 
Hangul 2356 79.4%
 
Common 387 13.0%
 
Latin 67 2.3%
 
Inherited 40 1.3%
 
Han 24 0.8%
 
Hiragana 13 0.4%
 
Katakana 13 0.4%
 
Tibetan 8 0.3%
 
Canadian_Aboriginal 7 0.2%
 
Arabic 5 0.2%
 
Other values (22) 46 1.6%
 
ValueCountFrequency (%) 
Hangul 2282 82.1%
 
ASCII 96 3.5%
 
Emoticons 61 2.2%
 
Compat Jamo 41 1.5%
 
Jamo 32 1.2%
 
Diacriticals 28 1.0%
 
Dingbats 25 0.9%
 
CJK 23 0.8%
 
Punctuation 21 0.8%
 
Misc Symbols 16 0.6%
 
Other values (43) 156 5.6%
 

review_star_durability
Real number (ℝ≥0)

ZEROS
Distinct count6
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.363817427
Minimum0
Maximum5
Zeros11489
Zeros (%)2.8%
Memory size3.1 MiB

Quantile statistics

Minimum0
5-th percentile2
Q14
median5
Q35
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.111470942
Coefficient of variation (CV)0.2547015223
Kurtosis5.391002171
Mean4.363817427
Median Absolute Deviation (MAD)0.8159220491
Skewness-2.280979502
Sum1792246
Variance1.235367654
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 1.5 2.5 3.5 4.5 5. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5 263371 64.1%
 
4 85593 20.8%
 
3 38046 9.3%
 
0 11489 2.8%
 
2 6674 1.6%
 
1 5533 1.3%
 
ValueCountFrequency (%) 
0 11489 2.8%
 
1 5533 1.3%
 
2 6674 1.6%
 
3 38046 9.3%
 
4 85593 20.8%
 
ValueCountFrequency (%) 
5 263371 64.1%
 
4 85593 20.8%
 
3 38046 9.3%
 
2 6674 1.6%
 
1 5533 1.3%
 

review_star_design
Real number (ℝ≥0)

ZEROS
Distinct count6
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.576660677
Minimum0
Maximum5
Zeros11489
Zeros (%)2.8%
Memory size3.1 MiB

Quantile statistics

Minimum0
5-th percentile3
Q15
median5
Q35
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.000099627
Coefficient of variation (CV)0.2185216903
Kurtosis10.83920677
Mean4.576660677
Median Absolute Deviation (MAD)0.6459855479
Skewness-3.20381897
Sum1879662
Variance1.000199264
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 1.5 2.5 3.5 4.5 5. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5 313354 76.3%
 
4 62544 15.2%
 
3 18649 4.5%
 
0 11489 2.8%
 
1 2571 0.6%
 
2 2099 0.5%
 
ValueCountFrequency (%) 
0 11489 2.8%
 
1 2571 0.6%
 
2 2099 0.5%
 
3 18649 4.5%
 
4 62544 15.2%
 
ValueCountFrequency (%) 
5 313354 76.3%
 
4 62544 15.2%
 
3 18649 4.5%
 
2 2099 0.5%
 
1 2571 0.6%
 

review_star_cost
Real number (ℝ≥0)

ZEROS
Distinct count6
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.421637376
Minimum0
Maximum5
Zeros11489
Zeros (%)2.8%
Memory size3.1 MiB

Quantile statistics

Minimum0
5-th percentile3
Q14
median5
Q35
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.078928777
Coefficient of variation (CV)0.2440111402
Kurtosis6.393939214
Mean4.421637376
Median Absolute Deviation (MAD)0.7817343148
Skewness-2.453567692
Sum1815993
Variance1.164087307
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 1.5 2.5 3.5 4.5 5. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5 277562 67.6%
 
4 75520 18.4%
 
3 37175 9.1%
 
0 11489 2.8%
 
2 5618 1.4%
 
1 3342 0.8%
 
ValueCountFrequency (%) 
0 11489 2.8%
 
1 3342 0.8%
 
2 5618 1.4%
 
3 37175 9.1%
 
4 75520 18.4%
 
ValueCountFrequency (%) 
5 277562 67.6%
 
4 75520 18.4%
 
3 37175 9.1%
 
2 5618 1.4%
 
1 3342 0.8%
 

review_star_delivery
Real number (ℝ≥0)

ZEROS
Distinct count6
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.380369413
Minimum0
Maximum5
Zeros11489
Zeros (%)2.8%
Memory size3.1 MiB

Quantile statistics

Minimum0
5-th percentile1
Q14
median5
Q35
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.167344278
Coefficient of variation (CV)0.2664944821
Kurtosis4.749773133
Mean4.380369413
Median Absolute Deviation (MAD)0.8494804556
Skewness-2.256251325
Sum1799044
Variance1.362692663
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 1.5 2.5 3.5 4.5 5. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5 281528 68.5%
 
4 66104 16.1%
 
3 33242 8.1%
 
0 11489 2.8%
 
1 9424 2.3%
 
2 8919 2.2%
 
ValueCountFrequency (%) 
0 11489 2.8%
 
1 9424 2.3%
 
2 8919 2.2%
 
3 33242 8.1%
 
4 66104 16.1%
 
ValueCountFrequency (%) 
5 281528 68.5%
 
4 66104 16.1%
 
3 33242 8.1%
 
2 8919 2.2%
 
1 9424 2.3%
 

review_star_avg
Real number (ℝ≥0)

Distinct count18
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.569233101
Minimum1
Maximum5
Zeros0
Zeros (%)0.0%
Memory size3.1 MiB

Quantile statistics

Minimum1
5-th percentile3.25
Q14.25
median5
Q35
95-th percentile5
Maximum5
Range4
Interquartile range (IQR)0.75

Descriptive statistics

Standard deviation0.6297288686
Coefficient of variation (CV)0.137819379
Kurtosis5.504838672
Mean4.569233101
Median Absolute Deviation (MAD)0.4817371702
Skewness-2.029103515
Sum1876611.45
Variance0.3965584479
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1. 1.125 1.375 1.875 2.375 ... 4.375 4.625 4.825 4.95 5. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5 210849 51.3%
 
4.75 44782 10.9%
 
4.5 40227 9.8%
 
4 37505 9.1%
 
4.25 27961 6.8%
 
3.75 15230 3.7%
 
3.5 11431 2.8%
 
3 7938 1.9%
 
3.25 6798 1.7%
 
2.75 2350 0.6%
 
Other values (8) 5635 1.4%
 
ValueCountFrequency (%) 
1 1553 0.4%
 
1.25 254 0.1%
 
1.5 385 0.1%
 
1.75 362 0.1%
 
2 710 0.2%
 
ValueCountFrequency (%) 
5 210849 51.3%
 
4.9 13 < 0.1%
 
4.75 44782 10.9%
 
4.5 40227 9.8%
 
4.25 27961 6.8%
 

product_info_id
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count1565
Unique (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean254435.5721
Minimum1666
Maximum605719
Zeros0
Zeros (%)0.0%
Memory size3.1 MiB

Quantile statistics

Minimum1666
5-th percentile32151
Q199687
median309648
Q3388715
95-th percentile388715
Maximum605719
Range604053
Interquartile range (IQR)289028

Descriptive statistics

Standard deviation147396.7359
Coefficient of variation (CV)0.5793086819
Kurtosis-1.378191839
Mean254435.5721
Median Absolute Deviation (MAD)134679.6247
Skewness-0.3802051181
Sum1.044982161e+11
Variance2.172579776e+10
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1666. 6500.5 7051. 7498. 9701.5 ... 591986. 592077. 593227.5 600047. 605719. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
388715 124975 30.4%
 
309648 44991 11.0%
 
32151 36549 8.9%
 
59910 17180 4.2%
 
111084 14112 3.4%
 
111274 8935 2.2%
 
161832 8271 2.0%
 
312751 5935 1.4%
 
208837 5486 1.3%
 
314450 5360 1.3%
 
Other values (1555) 138912 33.8%
 
ValueCountFrequency (%) 
1666 1 < 0.1%
 
3940 2 < 0.1%
 
4293 4 < 0.1%
 
6289 1 < 0.1%
 
6712 506 0.1%
 
ValueCountFrequency (%) 
605719 409 0.1%
 
594375 8 < 0.1%
 
592080 14 < 0.1%
 
592074 11 < 0.1%
 
591898 1 < 0.1%
 

production_info_name
Categorical

HIGH CARDINALITY
Distinct count1565
Unique (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.1 MiB
순수원목 A사이드테이블 3colors
124975
[주말특가] 스테이 프리미엄 차렵이불(세트) 10colors
44991
타미 1인 패브릭소파 3colors
36549
전자레인지 20L 실속형 다이얼식 MEM-GP20W MEM-GP20B 2colors
 
17180
빈백 607C 그랜드 소파 12colors
 
14112
Other values (1560)
172899
ValueCountFrequency (%) 
순수원목 A사이드테이블 3colors 124975 30.4%
 
[주말특가] 스테이 프리미엄 차렵이불(세트) 10colors 44991 11.0%
 
타미 1인 패브릭소파 3colors 36549 8.9%
 
전자레인지 20L 실속형 다이얼식 MEM-GP20W MEM-GP20B 2colors 17180 4.2%
 
빈백 607C 그랜드 소파 12colors 14112 3.4%
 
무선센서등 건전지 LED 동작감지 등 8935 2.2%
 
에스프레소 스팀 커피머신 2color 8271 2.0%
 
슈크림 세미워셔 차렵이불(세트) 10colors 5935 1.4%
 
유럽 안전인증! 혼요 오가닉코팅 디지털 에어프라이어 5486 1.3%
 
빅플러스 3D 거실 LED 벽시계-화이트 5360 1.3%
 
Other values (1555) 138912 33.8%
 

Length

Max length52
Mean length25.47160986
Min length3
ValueCountFrequency (%) 
Other_Letter 594 87.5%
 
Uppercase_Letter 28 4.1%
 
Lowercase_Letter 25 3.7%
 
Decimal_Number 10 1.5%
 
Other_Punctuation 8 1.2%
 
Math_Symbol 3 0.4%
 
Space_Separator 2 0.3%
 
Open_Punctuation 2 0.3%
 
Close_Punctuation 2 0.3%
 
Letter_Number 1 0.1%
 
Other values (4) 4 0.6%
 
ValueCountFrequency (%) 
Hangul 594 87.5%
 
Latin 54 8.0%
 
Common 31 4.6%
 
ValueCountFrequency (%) 
Hangul 594 88.0%
 
ASCII 80 11.9%
 
Number Forms 1 0.1%
 

production_info_brand_name
Categorical

HIGH CARDINALITY
Distinct count379
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size3.1 MiB
먼데이하우스
132989
마틸라
52806
보니애가구
46053
매직쉐프
 
17180
폴리몰리
 
14895
Other values (374)
146783
ValueCountFrequency (%) 
먼데이하우스 132989 32.4%
 
마틸라 52806 12.9%
 
보니애가구 46053 11.2%
 
매직쉐프 17180 4.2%
 
폴리몰리 14895 3.6%
 
에보니아 13966 3.4%
 
버즈가구 9846 2.4%
 
데이홈 8935 2.2%
 
모리츠 8271 2.0%
 
듀커소파 7300 1.8%
 
Other values (369) 98465 24.0%
 

Length

Max length13
Mean length4.546972774
Min length2
ValueCountFrequency (%) 
Other_Letter 304 86.9%
 
Lowercase_Letter 21 6.0%
 
Uppercase_Letter 18 5.1%
 
Decimal_Number 3 0.9%
 
Other_Punctuation 1 0.3%
 
Space_Separator 1 0.3%
 
Open_Punctuation 1 0.3%
 
Close_Punctuation 1 0.3%
 
ValueCountFrequency (%) 
Hangul 304 86.9%
 
Latin 39 11.1%
 
Common 7 2.0%
 
ValueCountFrequency (%) 
Hangul 304 86.9%
 
ASCII 46 13.1%
 

production_info_explain
Categorical

HIGH CARDINALITY
MISSING
Distinct count4505
Unique (%)1.1%
Missing10489
Missing (%)2.6%
Memory size3.1 MiB
상품명: A사이드테이블 / 색상: 우드
95150
아이보리
 
29451
상품명: A사이드테이블 / 색상: 화이트
 
22225
그레이
 
13802
매직쉐프 전자레인지 다이얼식 화이트 MEM-GP20W
 
9260
Other values (4500)
230329
ValueCountFrequency (%) 
상품명: A사이드테이블 / 색상: 우드 95150 23.2%
 
아이보리 29451 7.2%
 
상품명: A사이드테이블 / 색상: 화이트 22225 5.4%
 
그레이 13802 3.4%
 
매직쉐프 전자레인지 다이얼식 화이트 MEM-GP20W 9260 2.3%
 
상품명: A사이드테이블 / 색상: 블랙 6950 1.7%
 
건전지용 LED 무선센서등 [건전지미포함] 6710 1.6%
 
화이트 5650 1.4%
 
빅플러스 화이트 5330 1.3%
 
혼요 에어프라이어 K0014 5161 1.3%
 
Other values (4495) 200528 48.8%
 
(Missing) 10489 2.6%
 

Length

Max length73
Mean length19.71518556
Min length1
ValueCountFrequency (%) 
Other_Letter 627 87.2%
 
Uppercase_Letter 27 3.8%
 
Lowercase_Letter 24 3.3%
 
Decimal_Number 10 1.4%
 
Other_Punctuation 8 1.1%
 
Other_Symbol 7 1.0%
 
Math_Symbol 3 0.4%
 
Space_Separator 2 0.3%
 
Other_Number 2 0.3%
 
Open_Punctuation 2 0.3%
 
Other values (6) 7 1.0%
 
ValueCountFrequency (%) 
Hangul 626 87.1%
 
Latin 52 7.2%
 
Common 40 5.6%
 
Han 1 0.1%
 
ValueCountFrequency (%) 
Hangul 625 87.5%
 
ASCII 78 10.9%
 
Enclosed Alphanum 6 0.8%
 
Number Forms 1 0.1%
 
CJK 1 0.1%
 
Geometric Shapes 1 0.1%
 
Compat Jamo 1 0.1%
 
Misc Symbols 1 0.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size401.2 KiB
True
399213
False
 
11493
ValueCountFrequency (%) 
True 399213 97.2%
 
False 11493 2.8%
 

house_residence
Categorical

MISSING
Distinct count6
Unique (%)< 0.1%
Missing96288
Missing (%)23.4%
Memory size3.1 MiB
원룸&오피스텔
153596
아파트
106742
빌라&연립
48880
단독주택
 
3088
상업공간
 
2056
ValueCountFrequency (%) 
원룸&오피스텔 153596 37.4%
 
아파트 106742 26.0%
 
빌라&연립 48880 11.9%
 
단독주택 3088 0.8%
 
상업공간 2056 0.5%
 
기타 56 < 0.1%
 
(Missing) 96288 23.4%
 

Length

Max length7
Mean length4.74633923
Min length2
ValueCountFrequency (%) 
Other_Letter 23 88.5%
 
Lowercase_Letter 2 7.7%
 
Other_Punctuation 1 3.8%
 
ValueCountFrequency (%) 
Hangul 23 88.5%
 
Latin 2 7.7%
 
Common 1 3.8%
 
ValueCountFrequency (%) 
Hangul 23 88.5%
 
ASCII 3 11.5%
 

house_area
Categorical

MISSING
Distinct count41
Unique (%)< 0.1%
Missing96288
Missing (%)23.4%
Memory size3.1 MiB
32평
50604
5평
41532
8평
38908
17평
37552
9평
27620
Other values (36)
118202
ValueCountFrequency (%) 
32평 50604 12.3%
 
5평 41532 10.1%
 
8평 38908 9.5%
 
17평 37552 9.1%
 
9평 27620 6.7%
 
7평 19996 4.9%
 
10평 17748 4.3%
 
25평 17076 4.2%
 
34평 7644 1.9%
 
26평 7636 1.9%
 
Other values (31) 48102 11.7%
 
(Missing) 96288 23.4%
 

Length

Max length3
Mean length2.665415163
Min length2
ValueCountFrequency (%) 
Decimal_Number 10 76.9%
 
Lowercase_Letter 2 15.4%
 
Other_Letter 1 7.7%
 
ValueCountFrequency (%) 
Common 10 76.9%
 
Latin 2 15.4%
 
Hangul 1 7.7%
 
ValueCountFrequency (%) 
ASCII 12 92.3%
 
Hangul 1 7.7%
 

house_region
Categorical

HIGH CARDINALITY
HIGH CORRELATION
MISSING
Distinct count53
Unique (%)< 0.1%
Missing166224
Missing (%)40.5%
Memory size3.1 MiB
부산광역시 수영구
35608
경기도
25358
부산광역시
 
21976
서울특별시 서초구
 
21644
충청남도
 
20100
Other values (48)
119796
ValueCountFrequency (%) 
부산광역시 수영구 35608 8.7%
 
경기도 25358 6.2%
 
부산광역시 21976 5.4%
 
서울특별시 서초구 21644 5.3%
 
충청남도 20100 4.9%
 
서울특별시 은평구 19996 4.9%
 
경상남도 김해시 19996 4.9%
 
서울특별시 13068 3.2%
 
서울특별시 종로구 9360 2.3%
 
경기도 수원시 6368 1.6%
 
Other values (43) 51008 12.4%
 
(Missing) 166224 40.5%
 

Length

Max length11
Mean length5.365429285
Min length3
ValueCountFrequency (%) 
Other_Letter 64 95.5%
 
Lowercase_Letter 2 3.0%
 
Space_Separator 1 1.5%
 
ValueCountFrequency (%) 
Hangul 64 95.5%
 
Latin 2 3.0%
 
Common 1 1.5%
 
ValueCountFrequency (%) 
Hangul 64 95.5%
 
ASCII 3 4.5%
 

house_expertise
Categorical

HIGH CORRELATION
MISSING
Distinct count4
Unique (%)< 0.1%
Missing96288
Missing (%)23.4%
Memory size3.1 MiB
홈스타일링
276018
리모델링
 
24804
부분공사
 
13592
건축
 
4
ValueCountFrequency (%) 
홈스타일링 276018 67.2%
 
리모델링 24804 6.0%
 
부분공사 13592 3.3%
 
건축 4 < 0.1%
 
(Missing) 96288 23.4%
 

Length

Max length5
Mean length4.437592828
Min length2
ValueCountFrequency (%) 
Other_Letter 14 87.5%
 
Lowercase_Letter 2 12.5%
 
ValueCountFrequency (%) 
Hangul 14 87.5%
 
Latin 2 12.5%
 
ValueCountFrequency (%) 
Hangul 14 87.5%
 
ASCII 2 12.5%
 

house_color_list
Categorical

HIGH CARDINALITY
HIGH CORRELATION
MISSING
Distinct count52
Unique (%)< 0.1%
Missing180330
Missing (%)43.9%
Memory size3.1 MiB
화이트,그레이,베이지,민트,블루
35428
화이트,베이지,블루
26080
베이지,라이트 브라운
25076
화이트,베이지,라이트 브라운,브라운
22180
화이트,베이지,브라운,그린
 
20100
Other values (47)
101512
ValueCountFrequency (%) 
화이트,그레이,베이지,민트,블루 35428 8.6%
 
화이트,베이지,블루 26080 6.4%
 
베이지,라이트 브라운 25076 6.1%
 
화이트,베이지,라이트 브라운,브라운 22180 5.4%
 
화이트,베이지,브라운,그린 20100 4.9%
 
화이트,베이지,라이트 브라운,브라운,그린 19996 4.9%
 
화이트,베이지 9692 2.4%
 
브라운 7260 1.8%
 
라이트 브라운 6284 1.5%
 
베이지,라이트 브라운,브라운 4976 1.2%
 
Other values (42) 53304 13.0%
 
(Missing) 180330 43.9%
 

Length

Max length29
Mean length9.1571343
Min length3
ValueCountFrequency (%) 
Other_Letter 23 85.2%
 
Lowercase_Letter 2 7.4%
 
Other_Punctuation 1 3.7%
 
Space_Separator 1 3.7%
 
ValueCountFrequency (%) 
Hangul 23 85.2%
 
Common 2 7.4%
 
Latin 2 7.4%
 
ValueCountFrequency (%) 
Hangul 23 85.2%
 
ASCII 4 14.8%
 

house_style_list
Categorical

MISSING
Distinct count32
Unique (%)< 0.1%
Missing163586
Missing (%)39.8%
Memory size3.1 MiB
내추럴
59152
미니멀&심플
47576
미니멀&심플,내추럴
36292
모던,미니멀&심플,내추럴
35812
미니멀&심플,내추럴,클래식&앤틱
21052
Other values (27)
47236
ValueCountFrequency (%) 
내추럴 59152 14.4%
 
미니멀&심플 47576 11.6%
 
미니멀&심플,내추럴 36292 8.8%
 
모던,미니멀&심플,내추럴 35812 8.7%
 
미니멀&심플,내추럴,클래식&앤틱 21052 5.1%
 
내추럴,빈티지&레트로 8524 2.1%
 
북유럽,클래식&앤틱 7148 1.7%
 
모던,내추럴 4572 1.1%
 
내추럴,북유럽 3864 0.9%
 
모던,내추럴,유니크&믹스매치 3208 0.8%
 
Other values (22) 19920 4.9%
 
(Missing) 163586 39.8%
 

Length

Max length24
Mean length6.415192376
Min length2
ValueCountFrequency (%) 
Other_Letter 40 90.9%
 
Lowercase_Letter 2 4.5%
 
Other_Punctuation 2 4.5%
 
ValueCountFrequency (%) 
Hangul 40 90.9%
 
Common 2 4.5%
 
Latin 2 4.5%
 
ValueCountFrequency (%) 
Hangul 40 90.9%
 
ASCII 4 9.1%
 

house_constructions
Categorical

HIGH CORRELATION
MISSING
Distinct count36
Unique (%)0.1%
Missing344598
Missing (%)83.9%
Memory size3.1 MiB
원목마루,주방리모델링
20288
조명시공
11972
주방리모델링,폴딩도어,중문,발코니확장
6312
헤링본 마루,주방리모델링,가벽&파티션,발코니확장
6272
주방리모델링,조명시공
 
3780
Other values (31)
17484
ValueCountFrequency (%) 
원목마루,주방리모델링 20288 4.9%
 
조명시공 11972 2.9%
 
주방리모델링,폴딩도어,중문,발코니확장 6312 1.5%
 
헤링본 마루,주방리모델링,가벽&파티션,발코니확장 6272 1.5%
 
주방리모델링,조명시공 3780 0.9%
 
폴딩도어 3252 0.8%
 
가벽&파티션 2420 0.6%
 
포세린타일,주방리모델링,조명시공 2064 0.5%
 
주방리모델링,조명시공,중문 2040 0.5%
 
주방리모델링,조명시공,가벽&파티션 1504 0.4%
 
Other values (26) 6204 1.5%
 
(Missing) 344598 83.9%
 

Length

Max length44
Mean length4.595165398
Min length2
ValueCountFrequency (%) 
Other_Letter 44 89.8%
 
Lowercase_Letter 2 4.1%
 
Other_Punctuation 2 4.1%
 
Space_Separator 1 2.0%
 
ValueCountFrequency (%) 
Hangul 44 89.8%
 
Common 3 6.1%
 
Latin 2 4.1%
 
ValueCountFrequency (%) 
Hangul 44 89.8%
 
ASCII 5 10.2%
 

house_family_list
Categorical

MISSING
Distinct count28
Unique (%)< 0.1%
Missing96288
Missing (%)23.4%
Memory size3.1 MiB
[
78604
싱글라이프
46625
['싱글라이프']
46625
'싱글라이프'
46625
부모님과 함께 사는 집
 
12342
Other values (23)
83597
ValueCountFrequency (%) 
[ 78604 19.1%
 
싱글라이프 46625 11.4%
 
['싱글라이프'] 46625 11.4%
 
'싱글라이프' 46625 11.4%
 
부모님과 함께 사는 집 12342 3.0%
 
'부모님과 함께 사는 집' 12342 3.0%
 
['부모님과 함께 사는 집'] 12341 3.0%
 
신혼부부 9429 2.3%
 
['신혼부부'] 9429 2.3%
 
'신혼부부' 9429 2.3%
 
Other values (18) 30627 7.5%
 
(Missing) 96288 23.4%
 

Length

Max length27
Mean length5.978602699
Min length1
ValueCountFrequency (%) 
Other_Letter 25 78.1%
 
Lowercase_Letter 2 6.2%
 
Other_Punctuation 2 6.2%
 
Open_Punctuation 1 3.1%
 
Space_Separator 1 3.1%
 
Close_Punctuation 1 3.1%
 
ValueCountFrequency (%) 
Hangul 25 78.1%
 
Common 5 15.6%
 
Latin 2 6.2%
 
ValueCountFrequency (%) 
Hangul 25 78.1%
 
ASCII 7 21.9%
 

house_like_count
Real number (ℝ≥0)

HIGH CORRELATION
MISSING
Distinct count115
Unique (%)< 0.1%
Missing96288
Missing (%)23.4%
Infinite0
Infinite (%)0.0%
Mean252.2540567
Minimum14
Maximum1394
Zeros0
Zeros (%)0.0%
Memory size3.1 MiB

Quantile statistics

Minimum14
5-th percentile56
Q1151
median245
Q3294
95-th percentile482
Maximum1394
Range1380
Interquartile range (IQR)143

Descriptive statistics

Standard deviation155.7630033
Coefficient of variation (CV)0.6174846317
Kurtosis9.242650166
Mean252.2540567
Median Absolute Deviation (MAD)105.1223028
Skewness2.129639094
Sum79313216
Variance24262.1132
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
293 35428 8.6%
 
242 26080 6.4%
 
264 22912 5.6%
 
169 22116 5.4%
 
245 22052 5.4%
 
344 20112 4.9%
 
418 20004 4.9%
 
82 19996 4.9%
 
85 16668 4.1%
 
56 16244 4.0%
 
Other values (105) 92806 22.6%
 
(Missing) 96288 23.4%
 
ValueCountFrequency (%) 
14 6 < 0.1%
 
35 36 < 0.1%
 
41 24 < 0.1%
 
42 2924 0.7%
 
43 56 < 0.1%
 
ValueCountFrequency (%) 
1394 8 < 0.1%
 
1227 1688 0.4%
 
1217 8 < 0.1%
 
1080 20 < 0.1%
 
907 1816 0.4%
 

house_reply_count
Real number (ℝ≥0)

MISSING
Distinct count75
Unique (%)< 0.1%
Missing96288
Missing (%)23.4%
Infinite0
Infinite (%)0.0%
Mean43.56021602
Minimum2
Maximum213
Zeros0
Zeros (%)0.0%
Memory size3.1 MiB

Quantile statistics

Minimum2
5-th percentile10
Q123
median39
Q367
95-th percentile85
Maximum213
Range211
Interquartile range (IQR)44

Descriptive statistics

Standard deviation29.37903799
Coefficient of variation (CV)0.6744465632
Kurtosis3.45986883
Mean43.56021602
Median Absolute Deviation (MAD)23.31527187
Skewness1.276659761
Sum13696116
Variance863.127873
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
42 37592 9.2%
 
10 32912 8.0%
 
11 29400 7.2%
 
74 26080 6.4%
 
59 23780 5.8%
 
23 23116 5.6%
 
32 20100 4.9%
 
28 20044 4.9%
 
77 19996 4.9%
 
39 10080 2.5%
 
Other values (65) 71318 17.4%
 
(Missing) 96288 23.4%
 
ValueCountFrequency (%) 
2 30 < 0.1%
 
4 108 < 0.1%
 
6 2924 0.7%
 
8 40 < 0.1%
 
10 32912 8.0%
 
ValueCountFrequency (%) 
213 420 0.1%
 
187 1816 0.4%
 
183 8 < 0.1%
 
159 8 < 0.1%
 
140 20 < 0.1%
 

house_scrap_count
Real number (ℝ≥0)

HIGH CORRELATION
MISSING
Distinct count118
Unique (%)< 0.1%
Missing96288
Missing (%)23.4%
Infinite0
Infinite (%)0.0%
Mean755.3335369
Minimum184
Maximum4172
Zeros0
Zeros (%)0.0%
Memory size3.1 MiB

Quantile statistics

Minimum184
5-th percentile293
Q1548
median750
Q3803
95-th percentile1379
Maximum4172
Range3988
Interquartile range (IQR)255

Descriptive statistics

Standard deviation369.4283374
Coefficient of variation (CV)0.4890929892
Kurtosis13.48141254
Mean755.3335369
Median Absolute Deviation (MAD)231.1611218
Skewness2.654115022
Sum237490460
Variance136477.2965
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
752 35428 8.6%
 
776 26080 6.4%
 
750 21536 5.2%
 
854 20100 4.9%
 
599 19996 4.9%
 
548 19996 4.9%
 
293 19996 4.9%
 
1379 19996 4.9%
 
459 16892 4.1%
 
406 16244 4.0%
 
Other values (108) 98154 23.9%
 
(Missing) 96288 23.4%
 
ValueCountFrequency (%) 
184 6 < 0.1%
 
215 16 < 0.1%
 
257 32 < 0.1%
 
281 624 0.2%
 
293 19996 4.9%
 
ValueCountFrequency (%) 
4172 8 < 0.1%
 
3381 1688 0.4%
 
3343 8 < 0.1%
 
2494 168 < 0.1%
 
2489 20 < 0.1%
 

house_view_count
Real number (ℝ≥0)

MISSING
Distinct count123
Unique (%)< 0.1%
Missing96288
Missing (%)23.4%
Infinite0
Infinite (%)0.0%
Mean35156.35429
Minimum6311
Maximum105830
Zeros0
Zeros (%)0.0%
Memory size3.1 MiB

Quantile statistics

Minimum6311
5-th percentile8092
Q129964
median34193
Q338019
95-th percentile54147
Maximum105830
Range99519
Interquartile range (IQR)8055

Descriptive statistics

Standard deviation12901.21253
Coefficient of variation (CV)0.3669667345
Kurtosis2.568506988
Mean35156.35429
Median Absolute Deviation (MAD)9224.927387
Skewness0.6380943408
Sum1.10537906e+10
Variance166441284.7
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
38019 35428 8.6%
 
36998 26080 6.4%
 
25396 21536 5.2%
 
48281 20100 4.9%
 
30042 19996 4.9%
 
8092 19996 4.9%
 
46553 19996 4.9%
 
29964 19996 4.9%
 
33109 16668 4.1%
 
34193 16244 4.0%
 
Other values (113) 98378 24.0%
 
(Missing) 96288 23.4%
 
ValueCountFrequency (%) 
6311 6 < 0.1%
 
8092 19996 4.9%
 
13621 16 < 0.1%
 
14800 36 < 0.1%
 
16058 2924 0.7%
 
ValueCountFrequency (%) 
105830 8 < 0.1%
 
92584 1688 0.4%
 
90941 8 < 0.1%
 
86039 168 < 0.1%
 
80661 20 < 0.1%
 

house_share_count
Real number (ℝ≥0)

MISSING
Distinct count97
Unique (%)< 0.1%
Missing96288
Missing (%)23.4%
Infinite0
Infinite (%)0.0%
Mean69.63779427
Minimum0
Maximum921
Zeros6
Zeros (%)< 0.1%
Memory size3.1 MiB

Quantile statistics

Minimum0
5-th percentile12
Q125
median49
Q3107
95-th percentile253
Maximum921
Range921
Interquartile range (IQR)82

Descriptive statistics

Standard deviation69.94851481
Coefficient of variation (CV)1.004461953
Kurtosis7.961986145
Mean69.63779427
Median Absolute Deviation (MAD)47.96502806
Skewness2.507930452
Sum21895376
Variance4892.794723
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
14 36664 8.9%
 
107 35428 8.6%
 
49 26436 6.4%
 
37 21536 5.2%
 
123 20100 4.9%
 
31 20012 4.9%
 
25 20000 4.9%
 
59 19996 4.9%
 
12 16300 4.0%
 
71 9532 2.3%
 
Other values (87) 88414 21.5%
 
(Missing) 96288 23.4%
 
ValueCountFrequency (%) 
0 6 < 0.1%
 
6 2924 0.7%
 
8 36 < 0.1%
 
12 16300 4.0%
 
13 2168 0.5%
 
ValueCountFrequency (%) 
921 8 < 0.1%
 
641 8 < 0.1%
 
463 168 < 0.1%
 
458 396 0.1%
 
395 1816 0.4%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

df_indexidnamereview_countreview_averagescrap_countview_countreview_user_idreview_statusreview_textreview_star_durabilityreview_star_designreview_star_costreview_star_deliveryreview_star_avgproduct_info_idproduction_info_nameproduction_info_brand_nameproduction_info_explainproduction_info_is_purchasedhouse_residencehouse_areahouse_regionhouse_expertisehouse_color_listhouse_style_listhouse_constructionshouse_family_listhouse_like_counthouse_reply_counthouse_scrap_counthouse_view_counthouse_share_count
02264792YLLEVAD 윌레바드 액자21x30,사진/포스터24.008822982947.0확인완료저렴하고 간편하게 사용 가능해 좋아요0.00.00.00.04.0264792.0YLLEVAD 윌레바드 액자21x30,사진/포스터이케아NaNFalse아파트30평경기도홈스타일링NaNNaNNaN['부모님과 함께 사는 집']14.02.0184.06311.00.0
14388715순수원목 A사이드테이블 3colors285744.53206101626351663207.0확인완료고양이 옹동이 관찰용인가요? 넘좋군요 새로운용도 발견...ㅎ 조립하는거 너무힘들었어요 드릴이 있는데도;; 조립부분 나사구멍 크기가 양쪽이 너무 차이가 나요5.05.05.05.05.0388715.0순수원목 A사이드테이블 3colors먼데이하우스상품명: A사이드테이블 / 색상: 우드True원룸&오피스텔8평경기도홈스타일링NaN미니멀&심플NaN['싱글라이프']82.023.0293.08092.014.0
25388715순수원목 A사이드테이블 3colors285744.5320610162635779772.0확인완료유리라서 그런지 고양이가 매일 올라가있오요5.05.05.05.05.0388715.0순수원목 A사이드테이블 3colors먼데이하우스상품명: A사이드테이블 / 색상: 우드True원룸&오피스텔8평경기도홈스타일링NaN미니멀&심플NaN['싱글라이프']82.023.0293.08092.014.0
36388715순수원목 A사이드테이블 3colors285744.53206101626352779603.0확인완료예뻐용!! 멍뭉이때매 예쁘게 꾸며놓지는 못해찌만,,,5.05.05.05.05.0388715.0순수원목 A사이드테이블 3colors먼데이하우스상품명: A사이드테이블 / 색상: 우드True원룸&오피스텔8평경기도홈스타일링NaN미니멀&심플NaN['싱글라이프']82.023.0293.08092.014.0
47388715순수원목 A사이드테이블 3colors285744.53206101626353293434.0확인완료아이고 주인님이 좋아하신다니 정말 다행입니다 감사합니다5.05.05.05.05.0388715.0순수원목 A사이드테이블 3colors먼데이하우스상품명: A사이드테이블 / 색상: 우드True원룸&오피스텔8평경기도홈스타일링NaN미니멀&심플NaN['싱글라이프']82.023.0293.08092.014.0
58388715순수원목 A사이드테이블 3colors285744.53206101626351232276.0확인완료고양이가 너무너무조아해요 유리에 식빵구울때 귀여움 레전드5.05.05.05.05.0388715.0순수원목 A사이드테이블 3colors먼데이하우스상품명: A사이드테이블 / 색상: 우드True원룸&오피스텔8평경기도홈스타일링NaN미니멀&심플NaN['싱글라이프']82.023.0293.08092.014.0
69388715순수원목 A사이드테이블 3colors285744.53206101626351028829.0확인완료옛날부터 사고 싶었는데 품절이여서 못샀다가 이제야 사는데 집이랑도 너무 잘 어울리고 예쁜거 같아요 ㅎㅎ5.05.05.05.05.0388715.0순수원목 A사이드테이블 3colors먼데이하우스상품명: A사이드테이블 / 색상: 우드True원룸&오피스텔8평경기도홈스타일링NaN미니멀&심플NaN['싱글라이프']82.023.0293.08092.014.0
710388715순수원목 A사이드테이블 3colors285744.53206101626352232708.0확인완료역시 인기있는 이유가 있네요 ㅜㅜㅎㅎ 튼튼하고 조립하기 쉽고 인테리어에 딱! 넘 예뻐요5.05.05.05.05.0388715.0순수원목 A사이드테이블 3colors먼데이하우스상품명: A사이드테이블 / 색상: 우드True원룸&오피스텔8평경기도홈스타일링NaN미니멀&심플NaN['싱글라이프']82.023.0293.08092.014.0
811388715순수원목 A사이드테이블 3colors285744.53206101626351619522.0확인완료너무저렴해서 좀 걱정됬는데 리뷰가 좋아서 주문했는데 생각보다 엄청 튼튼하네요 깔끔하고 예뻐요5.05.05.05.05.0388715.0순수원목 A사이드테이블 3colors먼데이하우스상품명: A사이드테이블 / 색상: 우드True원룸&오피스텔8평경기도홈스타일링NaN미니멀&심플NaN['싱글라이프']82.023.0293.08092.014.0
912388715순수원목 A사이드테이블 3colors285744.5320610162635825616.0확인완료*장점*\r\n > 배송이 엄청 빠르고 설치가 쉽다!\r\n (받자마자 다리가 잉크얼룩이 있는걸 확인하고 교환신청했는데 \r\n 바로 다음날 도착하는 센스~)\r\n >생각보다 내구성이 좋다!\r\n (조립 완료해보니 흔들거림 하나도 없고 마감도 톱밥 날림 없이 매끈~)\r\n >이쁘고 가성비 짱짱이다!\r\n (사진으로 봐도 이쁘고 실물도 이쁘고 가격도 이쁘고~)\r\n*단점*\r\n > 좀 더 큰 사이즈가 있으면 좋겠다! \r\n (치수를 보고 구매한거지만 맥북프로 1.5배 면적이라 아쉬운 느낌?!)\r\n\r\n결론 = 단점이 없다....?! 너무 맘에 쏙들어서 강추함!5.05.05.05.05.0388715.0순수원목 A사이드테이블 3colors먼데이하우스상품명: A사이드테이블 / 색상: 우드True원룸&오피스텔8평경기도홈스타일링NaN미니멀&심플NaN['싱글라이프']82.023.0293.08092.014.0

Last rows

df_indexidnamereview_countreview_averagescrap_countview_countreview_user_idreview_statusreview_textreview_star_durabilityreview_star_designreview_star_costreview_star_deliveryreview_star_avgproduct_info_idproduction_info_nameproduction_info_brand_nameproduction_info_explainproduction_info_is_purchasedhouse_residencehouse_areahouse_regionhouse_expertisehouse_color_listhouse_style_listhouse_constructionshouse_family_listhouse_like_counthouse_reply_counthouse_scrap_counthouse_view_counthouse_share_count
410696413652388715순수원목 A사이드테이블 3colors285744.53206101626356284457.0확인완료생각보다 조금 커서 놀랐지만 귀엽구 예뻐요 그런데 조립하다가 제가 힘을 세게 준 건지 갑자기 부서지더라구요 ㅠㅠ 그리고 나사가 조이다보면 계속 뭉개져서 돌릴 수 없게 돼요 이건 제가 잘 못 돌리는 걸루...3.04.04.05.04.00388715.0순수원목 A사이드테이블 3colors먼데이하우스상품명: A사이드테이블 / 색상: 우드TrueNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
410697413653388715순수원목 A사이드테이블 3colors285744.53206101626354048432.0확인완료가성비는 좋아요~ 이가격에 협탁을 살거라 생각 못했어요. 그런데 조립하는데 1번에서부터 나사가 잘 안들어가져서 사진상 유리 가장 안쪽부분 나무가 까딱까딱 움직여요. 엄청 스트레스 받다가.. 특별히 무거운거 올려둘일은 없을것같아 그냥 씁니다. 함께 구매한 조명등이랑 너무 잘 어울려 위안이 되네요3.05.05.05.04.50388715.0순수원목 A사이드테이블 3colors먼데이하우스상품명: A사이드테이블 / 색상: 우드TrueNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
410698413654388715순수원목 A사이드테이블 3colors285744.53206101626351632368.0확인완료전 너무 좋아유 분위기두잇구 ㅎㅎㅎㅎㅎㅎ4.05.05.03.04.25388715.0순수원목 A사이드테이블 3colors먼데이하우스상품명: A사이드테이블 / 색상: 우드TrueNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
410699413655388715순수원목 A사이드테이블 3colors285744.53206101626353500183.0확인완료깔끔하고 이뻐요ㅎㅎ\n가격대비 좋은 상품인거 같아요.4.05.05.05.04.75388715.0순수원목 A사이드테이블 3colors먼데이하우스상품명: A사이드테이블 / 색상: 우드TrueNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
410700413656388715순수원목 A사이드테이블 3colors285744.53206101626354486808.0확인완료침대 옆에 두기 딱 좋은 사이즈에용 우드 살까 화이트 살까 고민했는데 화이트가 깔끔하고 더 화사해 보이네요!5.05.05.05.05.00388715.0순수원목 A사이드테이블 3colors먼데이하우스상품명: A사이드테이블 / 색상: 화이트TrueNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
410701413657388715순수원목 A사이드테이블 3colors285744.53206101626356122042.0확인완료튼튼하고 예뻐요\n침대옆에 두기 굿굿베리굿~5.05.05.05.05.00388715.0순수원목 A사이드테이블 3colors먼데이하우스상품명: A사이드테이블 / 색상: 우드TrueNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
410702413658388715순수원목 A사이드테이블 3colors285744.53206101626356201606.0확인완료사이드테이블로 잘 사용중입니다 조립하는것도 쉬워요5.05.05.05.05.00388715.0순수원목 A사이드테이블 3colors먼데이하우스상품명: A사이드테이블 / 색상: 우드TrueNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
410703413659388715순수원목 A사이드테이블 3colors285744.53206101626356723568.0확인완료마감처리가 별로에요. 다리부분에 찍힌 자국도 많고 유리가 안들어가길래 봤더니 나무가 제대로 제거 안됐더라고요. 칼로 열심히 긁어내서 겨우겨우 꼈네요. 짜증은 났지만 교환하기 귀찮아서 써요2.02.03.05.03.00388715.0순수원목 A사이드테이블 3colors먼데이하우스상품명: A사이드테이블 / 색상: 화이트TrueNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
410704413660388715순수원목 A사이드테이블 3colors285744.53206101626355816478.0확인완료다 설치하고 유리 낄려고 보니깐 저렇게 다 깨져있네요 전화도 안받으시고 어떻게 좀 해주세요1.01.01.01.01.00388715.0순수원목 A사이드테이블 3colors먼데이하우스상품명: A사이드테이블 / 색상: 우드TrueNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
410705413661388715순수원목 A사이드테이블 3colors285744.53206101626357101314.0확인완료조립 매우쉽고 이뻐요 강추입니다~~~5.05.05.05.05.00388715.0순수원목 A사이드테이블 3colors먼데이하우스상품명: A사이드테이블 / 색상: 화이트TrueNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN